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The DNA packaging motors of dou- 
ble-stranded DNA phages are mod- 
els for analysis of all multi-molecular 
motors and for analysis of several fun- 
damental aspects of biology, including 
early evolution, relationship of in vivo 
to in vitro biochemistry and targets for 
anti-virals. Work on phage DNA packag- 
ing motors both has produced and is pro- 
ducing dualities in the interpretation of 
data obtained by use of both traditional 
techniques and the more recently devel- 
oped procedures of single-molecule anal- 
ysis. The dualities include (1) reductive 
vs. accretive evolution, (2) rotation vs. 
stasis of sub-assemblies of the motor, 
(3) thermal ratcheting vs. power strok- 
ing in generating force, (4) complete 
motor vs. spark plug role for the packag- 
ing ATPase, (5) use of previously isolated 
vs. new intermediates for analysis of the 
intermediate states of the motor and 
(6) a motor with one cycle vs. a motor 
with two cycles. We provide background 
for these dualities, some of which are 
under-emphasized in the literature. We 
suggest directions for future research. 

DNA Packaging 
and Other Biological Motors: 
A Brief Summary 

Linear biological motors execute chemi- 
cal energy-dependent cycles that cause 
motion along a pre-assembled track. In 
the case of intracellular transport along 
a microtubule track, all known motor 
activity is in the globular region of each 
member of either a kinesin or a dynein 
dimer that walks along a microtubule. 
Each globular region has ATP cleaving 
activity (reviewed in refs. 1-3). In the case 



of muscle-contraction-associated move- 
ment along an actin filament track, all 
known motor activity is embedded in a 
myosin multimer, each member of which 
also has an ATP-cleaving globular seg- 
ment (reviewed in refs. 4—6). In the case 
of motion along a DNA track by a virus 
DNA packaging motor, all motor activ- 
ity is generally assumed to be located in 
the ATP-binding and neighboring regions 
of DNA packaging ATPase molecules 
assembled in a multimeric ring, as illus- 
trated in Figure 1 (recently reviewed in 
refs. 7-9). 

The term, motor, is used for these 
assemblies, in part, because increase in 
directionally biased motion (increase in 
mechanical energy) is produced via forces 
originating in breakage of covalent bonds 
(decrease in chemical energy). However, 
unlike many more familiar motors, bio- 
logical motors function, for practical pur- 
poses, at one temperature and do not use 
temperature differences to generate force. 

The term, motor, is also used because 
the chemistry involved in generating 
motion is cyclical in character. For exam- 
ple, non-random motion generated by 
a gasoline fire is not considered motor- 
derived unless the fire occurs cyclically, in 
an internal combustion engine, for exam- 
ple. Biochemistry can be made cyclical, 
without coordination by a multi-molec- 
ular complex. 1213 But, no evidence exists 
for a mechanism of this type within the 
cycle of a biological motor, to the authors' 
knowledge. The alternative is cyclical bio- 
chemistry that is generated by an if-then 
program embedded in assembled proteins, 
with the program derived via selection for 
genes that encode motor components and 
accessories. The job at hand is to learn 



www.landesbioscience.com 



Bacteriophage 



239 




Capsid I 



B fj^^jl Capsid 



Capsid 
gp10 



Scaffolding Connector Cylinder 
gp9 gp8 gp14/15/16 



dsDNA 



n Packaging ATPase 
9P19 

O DNA binding 
gp18 



DNA 




Connector 



Crown 



Clip 

Packaging 
ATPase 




C-terminal 
segment 



Tunnel 
loop 
segment 



Figure 1. DNA packaging of the related phages, T3 and T7. (A) The procapsid (capsid I) is shown after initiating packaging of a DNA molecule. (B) 
The conversion product of the procapsid (capsid II) is shown packaging DNA at a later stage. 711 (C) The connector (yellow)-terminase (green) motor 
complex is shown. (D) This motor complex is shown in more detail. The legend indicates the color-coding of the various components of the motor 
complex. The mature phage has a capsid II- like capsid, with a mature DNA molecule and a tail that is shown in Figure 3. 



what the program is at the level of the 
assembled motor. 

Learning the program for phage DNA 
packaging motors has been a priority 
for the following reasons. First, double- 
stranded DNA phages have common 
ancestry with some eukaryotic double- 
stranded DNA viruses (herpes viruses, 
for example) and archaeal viruses, based 
on sequence similarity. 14 " 16 This common 
ancestry implies origin of the proteins 
and, presumably the "program," at least 
1.6 billion years ago. 17,18 Thus, by study- 
ing phage DNA packaging motors, we are 
also studying ancient ancestry. Possibly, 
the various cellular motors evolved from 
phage DNA packaging motors. Some 
analogy exists, for example, between ABC 
transporters and phage DNA packaging 
motors. 15 The less analogous microtubule- 
and actin-motors potentially diverged 
at a higher rate, after forming a separate 
evolutionary branch. Tubulin does have 
archaeal ancestry, based on sequence 
homology. 20 

Second, phages are optimal for both 
biochemistry- and genetics-based analy- 
sis because of their short life cycle and 
relatively low expense of propagation. The 



related phages, T3 and T7, have a growth 
phase that lasts 13-15 min at 37°C and 
25-30 min at 30°C. Plaques form within 
2.5 h at 37°C. One can do two to three 
interdependent, plaque-based experi- 
ments per day. Other phages reproduce 
more slowly, but typically not more than 
3x more slowly. 21,22 Thus, phages provide 
model biological systems that are poten- 
tially more accessible to analysis than the 
systems of eukaryotes and their viruses. 

Third, in the case of phages, one has 
the best opportunity to obtain a series of 
biochemical/biophysical "snapshots" of a 
motor as it functions in cells (in vivo). For 
a DNA packaging motor, one does this by 
fractionating particles originally produced 
during DNA packaging in vivo; these par- 
ticles will be called intermediates, whether 
or not altered by fractionation. At this 
level, fractionation of intermediates has 
not been a viable strategy for most cellular 
motors, including those discussed above. 
Actin- and microtubule-based motors 
are studied in vitro, after reconstruction 
from purified components (reviewed in 
refs. 1-6). A strategy based on in vitro 
function is also often used with phage 
DNA packaging motors (reviewed in 



refs. 7—11). The fractionation of in vivo- 
generated intermediates provides access to 
features of the motor that are dependent 
on intracellular conditions that are not 
mimicked in in vitro systems. 

Fourth, detailed structures of phage 
DNA packaging intermediates are now 
becoming available via cryo-electron 
microscopy with 3D reconstruction (cryo- 
EM; see Duality #2 and Duality #5). 
Determining these structures adds analy- 
sis of dynamics to the previous analysis of 
viral structure. Analysis of dynamics is the 
topic of primary interest because it encom- 
passes most of the physical chemistry and 
biochemistry. 

Fifth, as spatial resolution increases for 
cryo-EM-determined structures of DNA 
packaging intermediates, possible targets 
for anti-viral compounds will emerge. 
One reasonably anticipates that the most 
vulnerable targets will be at the most 
motor-active sites. 

Some Motor Basics 

Understanding the thermal motion of 
motor components is part of the foun- 
dation for understanding the cycles of 
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biotic motors. The first principle is that 
no motions violate the second law of ther- 
modynamics. This principle has ramifi- 
cations throughout the fields of physics, 
chemistry and biology (Sup. Material, 
Section 1). 

Nonetheless, thermal motion can be 
the sole source of biotic motor-generated 
directional motion, if thermal motion is 
inhibited to an extent that differs with 
direction of motion (Sup. Material, 
Section 2). An empirically confirmed 
example exists in the case of the tran- 
scription-associated motion of phage T7 
RNA polymerase. 23 ' 24 Such differential 
inhibition (rectification) requires force, 
which is potentially derived from changes 
in covalent bonds, such as cleavage of 
ATP. Rectification of thermal motion is 
similar to rectification that occurs when a 
mechanical ratchet is driven in two direc- 
tions, but moves a load in only one direc- 
tion. Thus, when rectification of thermal 
motion is the sole source of motion gen- 
erated by a biotic motor, the term, ther- 
mal ratchet, is used. The point has been 
made that impure thermal ratchets exist, 
in which thermal ratcheting and other 
mechanisms for generating motion are 
both involved. 1 Thus, we will use the 
term, "pure thermal ratchet," when other 
mechanisms are insignificant. 

In 1957, Huxley proposed that myo- 
sin/actin motors worked as pure thermal 
ratchets. Thermal motion of a part of 
myosin was directionally "fixed" via ATP 
cleavage and then the rest of a myosin 
molecule was translated via tension cre- 
ated during the previous thermal motion. 
Actin asymmetry generated the specific- 
ity. 25 One of the authors (PS) had origi- 
nally proposed a mechanism for DNA 
packaging motors that had a thermal 
ratchet-like DNA binding event, but was 
driven more by an osmotic pressure dif- 
ferential than by thermal motion. 26 At 
the time, PS was not aware of the earlier 
proposal for myosin/actin motors. The 
data now show that neither myosin/actin 
motors nor ATPase-dependent phage 
DNA packaging motors work as either 
a pure thermal ratchet or its osmotically 
driven counterpart, as reviewed for myo- 
sin-, kinesin- and dynein-driven motors in 
references 1 and 27. We discuss the case 
DNA of packaging motors in Duality #3. 



Integration of 
Some Basics with Analysis 
of Biological Systems 

After dissociation and reassembly in vitro, 
tubulin- and actin-based motors have some 
technical advantages that have resulted in 
advances in some areas beyond what has 
been done with phage DNA packaging 
motors. Some of these advances are basic. 
Thus, we begin with them. The discussion 
of this section is based on the discussion 
of reference, 1 an especially well organized 
review. In the interest of simplicity of pre- 
sentation, we make no attempt here (or in 
subsequent sections) to be comprehensive 
in referencing. 

First, we introduce a second physical 
principle for biological motors: A pure 
thermal ratchet becomes more effective 
as the distance of ratchet-driving ther- 
mal motion becomes shorter. This prin- 
ciple is derived, semi-quantitatively, from 
Equation 1, the basic equation of thermal 
motion (diffusion). 28,29 

< AX 2 > = 2Dt (1) 

D is the diffusion coefficient and AX 
is the thermal motion-derived change 
in distance along the x-axis in a time, t. 
Therefore, the mean velocity (v) of diffu- 
sion-based displacement is approximately 
(< AX 2 >) 1/2 /t = 2D/(< AX 2 >) 1/2 . If F p is 
the apparent force applied by a pure ther- 
mal ratchet to overcome frictional resis- 
tance in a medium of viscosity, T|, with a 
load approximated by a sphere of radius, 

F_ F = v(6Trn# E ) = 12D(ttt\R e )/(< AX 2 >) 1/2 (2) 

That is to say, the greater the 
ratchet-driving diffusion distance [i.e., 
(< AX 2 >)" 2 ], the less the force generated 
and the less effective is a pure thermal 
ratchet. 

If a pure thermal ratchet works against 
non-friction-based force of absolute value, 
E L (E L is negative), then the motion-pro- 
ducing force decreases; v decreases and 
Equation 2 becomes: 

F_ F +F_ L = 12Z)(Trn# E )/(< AX 2 >)" 2 
or 

v = 2D/(< AX 2 >) 1/2 - F l /(6ttj]R e ) (3) 



As F L increases at constant < AX 2 >, 
eventually v becomes zero; the relationship 
between v and F_ L is a test for whether or 
not a pure thermal ratchet is in operation. 
If the motor applies a force of its own (of 
absolute value, F p ), then F L in Equation 3 
is replaced by E L -F p . The result of applying 
a F p is typically called a power stroke when 
F p occurs in quantized fashion, which it 
usually does in the systems that have been 
studied. 

Tubulin- and actin-based motors both 
have a power stroke in vitro. Investigators 
detected power stroke-generated, quan- 
tized motion by attaching a bead to the 
motor protein and using light microscopy 
to quantify bead progression along a track 
(Sup. Material, Section 3). Quantized 
motion means that periods of no motion 
are interspersed with periods in which 
distance along a track changes in a time 
short in comparison to the time of a cycle, 
thereby creating steps in a plot of distance 
vs. time. We will use the word, nanome- 
try, for procedures of the above type, when 
used for detecting and quantifying the 
motions of a biotic motor. Both the exis- 
tence of steps and various analyses based 
on equations related to Equation 3 imply 
that a power stroke exists for tubulin- and 
actin-based motors. 1 " 3,27 

In work on tubulin- and actin-based 
motors, the only known systematic 
change between power strokes is the 
cleavage of ATP to produce ADP and 
phosphate. No systematic change occurs 
in the load on the motor, although load 
fluctuations can occur. The ATP cleavage- 
derived changes initially have negligible 
effect and measurements can be made 
such that changes in the concentration of 
ATP and cleavage products are taken into 
account. When that is done, measure- 
ment of (1) consumption of ATP per time, 
(2) number of motors and (3) number of 
steps per motor per time yields the number 
of ATP molecules cleaved per step. The 
result is 1 ATP cleavage for each step. 1 " 3,27 
The constancy of conditions justifies com- 
puting averages that result in the conclu- 
sion that one power stroke is accompanied 
by cleavage of 1 ATP molecule. 

If constancy of conditions does not 
exist, the computing of averages is not 
likely to provide an accurate version of 
ATP utilization, unless ATP utilization 
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per motor step vs. time is known. In all 
calculations of this type, one ignores the 
fact that the cycle has thermal motion- 
derived and possibly other perturbations; 
no two iterations of the cycle can be 
exactly the same. 

Low-Resolution Structure 
and Dynamics of Phage 
DNA Packaging Motors 

A phage (or eukaryotic virus) DNA pack- 
aging motor induces directionally biased 
motion of a DNA molecule through a 
DNA-encircling ring. The ring is attached 
to a phage capsid. DNA motion through 
the ring causes packaging of the DNA 
molecule in a cavity of the protein shell of 
the phage capsid. DNA packaging starts 
with a capsid (called a procapsid) that is 
assembled in the absence of DNA. The 
T3/T7 procapsid (also called capsid I) is 
illustrated in Figure 1A. The procapsid 
initiates packaging, but undergoes several 
changes that occur early in packaging. 
These changes usually (not always) include 
increase in radius, increase in angularity, 
decrease in the magnitude of the average 
electrical surface charge and loss of inter- 
nal protein that served as a scaffold for 
shell assembly (reviewed in refs. 7-11), as 
illustrated for T3/T7 in Figure IB. 

The shells vary in both shape and com- 
position among different phages and also 
eukaryotic viruses. The simplest shells are 
icosahedrally symmetric arrays of a single 
protein (for example, Fig. 1). Variations 
on this theme include (1) additional bands 
of major shell protein subunits between 
two icosahedral hemispheres, so that 
the shell resembles an ellipsoid, (2) shell 
decoration proteins, the subunits of which 
fit into the lattice of major shell protein 
and (3) major shell protein substitutes that 
form pentamers at 5-fold axes (reviewed in 
refs. 7, 8, 30 and 31). To keep the focus on 
basic principles, we will assume the simple 
shell of Figure 1, unless data for a decora- 
tion protein appear to add to the basics of 
DNA packaging. 

All studied viral procapsids have a 
DNA-encircling ring component that, in 
the case of phages, is called either the por- 
tal ring or the connector. In all cases, the 
connector is a 12-membered ring of one 
protein. The term, connector, originates 



from the finding that, in the mature 
phage particle, this ring also connects the 
shell to an external appendage (tail). The 
tail attaches the phage to its host cell just 
before the packaged genome is injected 
into the cell to start an infection. The 
(12-fold) connector is located at an axis 
of 5-fold symmetry and replaces five shell 
protein monomers (reviewed in refs. 7—9 
and 30-32). The connector is illustrated 
for the related phages, T3 and T7, in 
Figure 1. The various connectors have 
similarity at the level of tertiary struc- 
ture, although the underlying sequence 
similarity is often too distant to detect, 
thus far. 30-32 Based on the similarity of 
structure, we will make the conventional 
assumption that the connector's role in 
DNA packaging is the same for all studied 
double-stranded DNA phages. 

For activity, all studied viral DNA 
packaging motors also require a ring- 
assembled protein that is attached to the 
external side of the connector. The num- 
ber of subunits is such that a symmetry 
mismatch usually occurs with the con- 
nector. 7 " 11,33,34 In all known cases, this 
protein has an N-terminal segment that 
has been demonstrated either biochemi- 
cally 34,35 or informatically 35,36 (see also 
Duality #1) to be an ATPase. This pro- 
tein is called the DNA packaging ATPase. 
Unlike other well-studied phages, phage 
phi29 has a small RNA (pRNA) that 
accompanies the packaging ATPase in 
equimolar amount. 37,38 Cryo-EM, with 
conventional asymmetric 3D recon- 
struction (not designed to be sensitive 
to tandem, uncorrelated symmetry mis- 
matches), reveals 5 -fold symmetry for the 
phi29 ATPase/pRNA ring (reviewed in 
ref. 39). Nonetheless, the pRNA ring can 
also be assembled from covalently linked 
pRNA dimers, which suggests that it and 
the ATPase can, in some situations, form 
a 6-membered ring (reviewed in ref. 38). 

Phage phi29 is also unusual in that 
the DNA substrate for packaging is the 
same as the DNA that was injected at 
the start of infection. 37 " 3 ' Most studied 
double-stranded DNA phages, including 
T3/T7, package an end-to-end multimer 
of mature genomes (concatemer) that is 
cleaved during packaging to the mature 
genome (not shown in Fig. 1). The cleav- 
age activity is in the C-terminal domain 



of the packaging ATPase. In recognition 
of the role of this protein in the second 
cleavage, the DNA packaging ATPase is 
also called terminase in the case of con- 
catemer-packaging phages. 7 " 11 

The various packaging terminases 
have recognizable sequence similarity, 
as shown via secondary structure pre- 
diction for some of them in Figure 2. 
This sequence and predicted secondary 
structure similarity has been extended 
to the packaging ATPase of phage phi29 
(Fig. 2), as previously discussed in a pre- 
liminary report. 44 The extension to phage 
phi29 was potentiated by (1) first, forming 
families of relatively close relatives and (2) 
then, probing among families. The align- 
ment of Figure 2 indicates that the phi29 
packaging ATPase has part of the endo- 
nuclease domain, which suggests that the 
endonuclease domain of terminases is part 
of the motor. Motor-participation of part 
of the endonuclease domain is also sug- 
gested by data for phage T4 (reviewed in 
ref. 39) and SPP1. 45 

The symmetry mismatch of the pack- 
aging ATPase/terminase rings is not uni- 
versal among phages. In the case of phage 
lambda, four subunits are in the terminase 
ring, together with 8 subunits of a DNA 
binding protein analogous to T3/T7 gp 
18. 46 One of us (PS) has previously pro- 
posed the hypothesis that connector/ 
ATPase symmetry mismatch, when it 
exists, was selected for suppression of ther- 
mal noise-driven errors during packag- 
ing. 47 If so, this mismatch would become 
less of an advantage as the effects of ther- 
mal noise became reduced, possibly by 
averaging that is greater during propaga- 
tion in the laboratory than it is in the wild. 

Duality #1: 

(A) Reductive Evolution, 

(B) Accretive Evolution 

The existence of the cf>29 pRNA raises 
the question of the relative contributions 
of reductive and accretive evolution in 
generating the pRNA. A similar question 
can be asked about other aspects of virus 
assembly. We insert here a short section on 
this duality, before continuing with the 
main theme. 

Evidence in favor of reductive evolu- 
tion includes electron micrographs of 
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Figure 2. Alignment and secondary structure prediction for terminases and the phi29 packaging ATPase. The proteins were first aligned via sequence 
similarity, without consideration of the secondary structure. Predicted secondary structures (a-helix, red; (3-structure, blue) were then superimposed 
on the previous alignment. Triangles indicate that the alignment excluded some sequence, the length of which is proportional to the size of the 
triangle. The following are indicated at the top. (A) A region found similar to P-loop ATPases/helicases, for which the prototype of known, crystallogra- 
phy-based structure is the helicase, PcrA. 40 (B) A region of predicted secondary structure found to be variable among the different terminases and that 
is thought to be flexible and to act as a hinge between the ATPase and endonuclease domains. (C) A region found similar to DNA branch-resolving en- 
donucleases, for which the prototype of known, crystallography-determined structure is RuvC/"" 43 The regions at the two ends of the terminases are 
missing in the phi29 packaging ATPase. These regions of terminases are essential for the binding of a terminase to (1) the connector, in the case of the 
C-terminal region, as discussed in the text, and (2) a smaller protein (gp18 for T3/T7, as illustrated in red in Fig. 1A and B), in the case of the N-terminal 
region. The smaller protein is an accessory protein that initiates DNA packaging (reviewed in refs. 7-1 0). The genes for the various terminases are the 
following: lambda, A; phi29, 16; T4, 17;T3/T7, 19; SPP1, 2. 



thin sections of eukaryotic cells infected 
by some viruses. These observations sug- 
gest that some viruses have arisen from 
cells that, first, became parasitic on other 
cells and, then, progressively lost (and are 
losing) functions. 48 ' 49 The key observa- 
tion is subcellular partitioning such that 
a residual of a virus-precursor cell appears 
to remain as a "virus factory." 50 Although 
"virus factory" is not precisely defined, 
these data are hard to explain without 
reductive evolution. 48 ' 49 

Other evidence is in favor of the pro- 
posal that accretive evolution is also a major 
theme. In relation to cells, viruses give, as 
well as take, benefits. Specifically, viruses 
give benefits by horizontal transfer of 
genes, a well-established phenomenon. 51-53 
The genes transferred from viruses are 
possibly more evolved than cellular genes 
for the same purpose, given that viruses 
replicate more rapidly than cells. Thus, 
in theory, selective pressure on microbial 
communities (as a whole) can lead to 
the acquisition of genes by viruses. 54 In 
practice, the larger viruses have genes for 
metabolism that (1) are functional, based 
on absence of sequence defects, but that 



(2) are in the category of genes that are 
provided by cells. Therefore, in this case, 
accretive evolution is likely. 54,55 

In the case of the 4>29 DNA packaging 
ATPase, the presence of a partial endonu- 
clease domain suggests that (1) an ancient 
ancestor had both a complete endonucle- 
ase domain and a concatemeric packaging 
substrate, and (2) the C-terminal region 
of this ancestor ATPase was lost when 
the packaging substrate was no longer a 
concatemer. If so, during the loss of endo- 
nuclease activity, the packaging ATPase 
may have become too small to provide a 
framework for its participation in a motor. 

Thus, a reasonable proposal is that 
(1) an RNA precursor to the pRNA ini- 
tially assisted a terminase and (2) as the 
terminase was shortened, the precursor 
RNA co-evolved to form a pRNA "pro- 
tein segment-substitute" that joined pack- 
aging ATPase to connector. 44 Empirically, 
(1) the pRNA does form the link between 
the packaging ATPase and the connec- 
tor, 39,56,57 as does the C-terminal region 
of terminases, 11,58,59 and (2) the function 
of an in vitro-essential domain of the 
pRNA (domain I; reviewed in ref. 39) is 



insensitive to change in sequence, if the 
approximate secondary structure and total 
length is maintained. 60 A second domain 
(domain II) is not necessary in vitro, 
although it is necessary in vivo based on 
the observation that all phi29 relatives 
have domain II (reviewed in ref. 39). 

These observations favor (but do not 
prove) reductive evolution to produce 
the 4>29 pRNA. One can only wonder 
how many of the ever-present, small cel- 
lular RNAs are at some stage of acting as 
a protein segment-substitute that evolved 
because the protein (1) had originally 
been selected for more than one function, 
and (2) had a needed function compro- 
mised when the protein changed because 
another of the original functions was no 
longer needed. 

Duality #2: Sub-Assembly 
(A) Rotation, (B) Stasis 

Rotation of a connector initially appeared 
to be an essential feature of DNA pack- 
aging, for several reasons. 61 " 63 Absence 
of rotation, however, was still a possibil- 
ity 26 and was the other half of duality #2. 
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Figure 3. Asymmetric 3D reconstruction of phage T7. (A) The entire phage is viewed perpendicular to the axis of the tail/core, at 20 A resolution. 
(B) The tail is viewed parallel to its axis. This 3D reconstruction is the unpublished work of Weimin Wu, Philip Serwer and Wen Jiang. 



However, a bulky group attached to the 
phage T4 connector did not inhibit pack- 
aging in vivo 64 and (2) the fluorescence 
of a dye attached to the 4>29 connector at 
any of several locations did not undergo 
the anisotropy decay expected for rotation 
of a connector that is linked to packag- 
ing in vitro. In the latter case, the pack- 
aging was monitored by nanometry of 
a bead attached to the distal end of the 
DNA molecule being packaged. 65 Thus, 
a connector "no rotation conclusion" was 
drawn. 

One prediction based on the connector 
no rotation conclusion is that the connec- 
tor has a unique orientation in relation to 
the tail and shell of the mature phage par- 
ticle, in spite of a shell/connector symme- 
try mismatch. In the case of phage T7, the 
accuracy of this prediction has been tested 
by 3D reconstruction from cryo-electron 
micrographs with no symmetry assumed 
during the reconstruction (asymmetric 
3D reconstruction). The result confirms 
the prediction in that the 6-fold tail, 



connector and tail fibers are reconstructed 
with high definition and have a unique 
orientation in relation to the locally 5-fold 
shell, in spite of the 5-fold/6-fold sym- 
metry mismatch (Fig. 3). Thus, we will 
make the assumption that the connector 
no rotation conclusion is correct, below. 

However, this conclusion does not 
imply that the DNA molecule does not 
rotate during packaging. Specifically, the 
DNA molecule might rotate without con- 
nector rotation, perhaps relatively slowly 
to either (1) avoid tangling during con- 
densation or (2) assist packing to the high 
final density. Such rotation has been as 
proposed by several authors as one pos- 
sible explanation of differences between 
molecular dynamics-predicted simulation 
and observed patterns of packaged DNA 
knotting. 66 ' 67 

The possibility of packaging-associ- 
ated DNA rotation raises the question of 
how rotation might be controlled. In the 
case of phages T3 and T7, a gpl4/15/l6 
roughly cylindrical, multi-layer 



structure, sometimes called a protein 
core, is attached to the shell-interior 
side of the connector (Fig. 1). This 
core might cause packaging-associated 
DNA rotation by either (1) directly driv- 
ing DNA rotation or (2) acting as a 
rotation-of-DNA biasing pure thermal 
ratchet. Both of these possibilities rely 
on mechanical energy from the (ATP 
cleavage-driven) motion of the DNA 
molecule during packaging. The DNA 
molecule can impart energy if the axis of 
the entry channel changes during packag- 
ing, thereby bending the DNA molecule 
and causing some packaging energy to 
be diverted to the walls of the channel. 
Empirically, cryo-EM-based asymmet- 
ric 3D reconstruction of T7 capsid I has 
revealed that, as the distance from the con- 
nector increases along the core, the axis of 
the channel becomes progressively further 
from the axis of the connector. The offset 
reaches 7.3 A at the core tip (unpublished 
observations of Fei Guo, Philip Serwer 
and Wen Jiang) . 
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Duality #3: 
(A) Thermal Ratcheting, 
(B) Power Stroking 

As described in Integration of Some Basics 
with Analysis of Biological Systems, ther- 
mal ratcheting becomes less effective as the 
ratchet-distance increases. If we assume 
that all biological motors are pure thermal 
ratchets (one half of duality #3, in which 
the other half is a power stroke-based 
motor) and have both ratchet-distances and 
ratcheting particle sizes that are roughly 
the same, then the motors should produce 
roughly the same force (see Equation 3). 
However, DNA packaging motors can over- 
come retarding forces as high as 110 pN, 
as determined by introducing various 
values of F_ L during nanometry and extrap- 
olating to 100% packaging. 9 ' 68 ' 69 On the 
other hand, kinesin, dynein and myosin- 
based motors generate maximal forces 
20-50 x smaller than that. 1 " 3 ' 5,6 The sim- 
plest, most likely (not the only) interpre- 
tation of these observations is that DNA 
packaging motors are not pure thermal 
ratchets and have a power stroke that is 
more powerful than the power stroke of 
the other motors (see also Sup. Material, 
Section 4). 

The question of a power stroke dur- 
ing phage phi29 DNA packaging was 
more definitively answered by higher- 
resolution nanometry of 4>29 DNA pack- 
aging. First, the packaging was found to 
be quantized. 71,72 Second, F L and v were 
independently measured for phi29 in vitro 
packaging slowed by methylation of a 
patch of 10 DNA phosphates. The result 
was incompatible with a pure thermal 
ratchet (see Equation 3). 72 That is to say, 
the phi29 DNA packaging motor has a 
power stroke when it undergoes a cycle in 
vitro in a system of purified components. 
The high forces of the phi29 motor also 
occur for the DNA packaging motors of 
phages lambda 73 and T4. 74 In addition, T4 
DNA is subjected to enough packaging- 
associated force to shorten a motor-asso- 
ciated DNA segment, as judged by Forster 
resonance energy transfer (FRET). 75,76 
Parenthetically, this force was assumed to 
be generated between terminase and con- 
nector, 75,76 but might be generated between 
two parts of the connector. Thus, we will 
assume that the duality of this section is 



not a duality any more for phage packag- 
ing ATPases/terminases. That is to say, 
these enzymes work via a power stroke. 

Duality #4: 
Packaging ATPase as 
(A) Complete Motor, 
(B) Spark Plug 

The energy transduction pathway of phage 
DNA packaging is the source of another 
duality: all-ATPase motor vs. ATPase as a 
spark plug-like component of a more com- 
plex motor. "ATPase-as-spark plug" was 
suggested by the effects of mutational alter- 
ation of several regions of the phage SPP1 
connector. Some mutations altered DNA 
packaging speed, while not altering binding 
of the ATPase to the procapsid-associated 
SPP1 connector. 77 X-ray crystallographic 
analysis of the structure of isolated connec- 
tors has shown that some of the packaging- 
slowing mutations are distal to the points 
at which the connector ring can contact a 
DNA molecule being packaged. 78 In addi- 
tion, cross-linking-inhibition of motion in 
the middle part of the connector reversibly 
inhibits DNA packaging. 79 

Thus, the authors of these studies have 
(reasonably) proposed that "cross-talk" 
occurs between connector and terminase 
during DNA packaging. Further, they 
have proposed a model in which the con- 
nector delivers the power stroke with a 
force transduction pathway that origi- 
nates in spark plug-like activity of the 
packaging ATPase. 78 As discussed below, 
the data do not support the power stroke- 
delivering connector region (tunnel loop; 
Duality #4, Continued) that was proposed 
in reference 78. But, connector-delivery of 
the power stroke is still a viable proposal, if 
this proposal includes the assumption that 
an alternative connector region delivers the 
power stroke. 

Other studies have suggested models in 
which the packaging ATPase cycle is the 
complete cycle, not simply the chemical 
energy-converting component of a more 
complex cycle. These "all-ATPase" models 
form the other half of duality #4, a dual- 
ity thus far unresolved. One proposed all- 
ATPase cycle is based on finding of two 
conformations of the packaging ATPase. 
These conformations were proposed to 
be the initial and final conformations of 



a power stroke that works via DNA push- 
ing and that depends on maintaining the 
DNA double helix and the firing ATPase 
subunit in phase. 80 Other all-ATPase 
models are designed to account for the 
nanometry-based findings of phi29 power 
strokes (1) that do not remain in phase 
with the DNA double helix and (2) that 
have four equal components. 71,81 These lat- 
ter models propose either a reach and pull 71 
or a push and roll 81 power stroke that does 
not require phasing with the double helix 
because the proposed power stroke works 
via steric DNA interaction. But, these lat- 
ter models generate some difficulties in 
maintaining procapsid-motor contact. 80 

Evidence to resolve this duality is incom- 
plete. On the one hand, an all-ATPase cycle 
is favored by the fact that motor ATPases 
for movement along myosin and actin (see 
DNA Packaging and Other Biological 
Motors and Integration of Some Basics 
with Analysis of Biological Systems) are 
known to work without any accessory 
protein resembling a connector. 1 " 6 On the 
other hand, phage DNA packaging motors 
are non-analogous to the eukaryotic 
motors discussed here in that (1) phage 
DNA packaging motors experience rela- 
tively high effects of thermal noise because 
they work as single motors, whereas the 
eukaryotic motors have noise suppres- 
sion generated by working in groups 47 and 
(2) the data for phage SPP1 77 " 79 and, more 
recently, for phage T4, 82 do indicate a con- 
nector/ATPase interaction. 

A progressively increasing load (next 
paragraph) suggests another reason for 
skepticism about the concept of an all- 
ATPase motor. An all-ATPase motor 
will be inefficient in that it has no obvi- 
ous mechanism for inter-cycle energy 
storage/release. In an ATPase-as-spark 
plug motor, the connector could, indepen- 
dently of the packaging ATPase, evolve to 
store/release chemical energy. This process 
would, flywheel-like, increase efficiency of 
energy usage, given the non-uniform force 
required for packaging (next paragraph). 

The average in vitro ATP usage was 
measured to be 1 per two base pairs 
packaged for phages phi29 83 and T3, 84 
although 4x greater for phage lambda. 85 
However, unlike the eukaryotic motors, 
DNA packaging motors have resistive 
force that systematically increases with 
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the fraction (F) of the track traversed, 
both in theory 86 and in practice (reviewed 
in ref. 9). Thus, uniform ATP-derived 
energy usage is wasteful at the beginning 
of packaging. Surprisingly, no measure- 
ment has been made of ATP usage vs. F. 
The variable, F, is the fraction of the DNA 
packaged in the case of DNA packaging. 

Based on the above discussion, high 
on the agenda for the future should be 
determination of ATP utilization vs. F. 
A potential start in this area is the find- 
ing that some phage lambda terminase 
mutants undergo increased in vitro stall- 
ing that does not spontaneously reverse. 
One of these mutants (T194M) (1) has a 
DNA packaging-associated ATP cleavage 
rate lowered by a factor of 0.008, while 
the nanometry determined packaging 
rate was lowered by factor of 0.14, i.e., 
18 x less. These numbers were apparently 
measured for F below 0.6. 87 The authors 
of reference 87 explain this discrepancy 
via potential (not demonstrated) techni- 
cal deficiencies, such as lowered packag- 
ing initiation and increased un-reversed 
stalling, during measurement of the ATP 
cleavage of the mutant. Un-reversed stall- 
ing was observed for both wild type and 
mutants. An alternative explanation is 
that ATP cleavage is not directly related 
to ATP usage, most dramatically at low F. 

Duality #4 (Continued): 
Details of a Possible Connector- 
Produced Power Stroke 

Assuming, for the moment, that the pack- 
aging ATPase is analogous to a spark plug, 
the following details were proposed for 
the delivery of the power stroke, based 
on work on phage SPP1. 78 (1) Disordered 
extensions (called tunnel loops) of all 12 
connector subunits occupy the channel 
simultaneously, along with the DNA mol- 
ecule. (2) The tunnel loops deliver power 
strokes sequentially, one-at-a-time. This 
proposal produced some difficulties in fit- 
ting all the tunnel loops in the channel and 
required that the tunnel loops made con- 
tact with the DNA molecule in different 
planes along the axis of the connector. 78 

One of the authors (PS) proposed 
another version of the connector-delivered 
power stroke, which was based on the 
occupation of the channel by extensions of 



three connector subunits at a time, rather 
than all 12. The proposed power stroke 
was delivered in four "mini-bursts," each 
mini-burst delivered by connector-exten- 
sions in a different set of three. 47 The four 
mini-bursts accounted for the fact that 
four steps per power stroke had previously 
been shown by nanometry for phi29 in 
vitro packaging. 71 (see also Sup. Material, 
Section 4). One of the model's predic- 
tions 47 was that three subunits at a time 
(only) occupy the channel of the connec- 
tor. A corollary is that the channel's con- 
ductivity can be altered by the presence of 
1, 2 or 3, but not more, subunits at a time. 
This prediction was subsequently found 
accurate via measurement of the conduc- 
tivity of phi29 connectors embedded in 
artificial membranes. 88 Another predic- 
tion, 47 also confirmed, 89 was that the con- 
nector is asymmetric in its resistance to 
DNA motion. 

In addition to the tunnel loop, which is 
at the center of the connector in the axial 
direction, a C-terminal region of disor- 
dered peptide projected into the channel 
of the connector of both the phi29 39 and 
SPP1 78 connectors. These regions are illus- 
trated in Figure 1C and D, which have 
skeletal drawings of the connector. The 
proposal of reference 47 was that either or 
both of these projections could deliver the 
power stroke. 

However, the following test produced 
results incompatible with delivery of the 
power stroke by the tunnel loops of the 
phi29 connector. First, the tunnel loop- 
encoding region was deleted from the 
gene that encodes the phi29 connector. 
Then, packaging without the tunnel loops 
was characterized and found (1) to go to 
completion and (2) to do so without nano- 
metry-measured decrease in force. 50 If one 
makes the assumption that the structure 
of the undeleted region of the connector 
is not changed by deletion of the tunnel 
loop-encoding region, this observation 
leaves only the C-terminal projection as 
a possible power stroke-delivering region 
of the connector. The latter possibility 
has apparently not been checked. The 
C-terminal region is the most mobile part 
of the connector (reviewed in ref. 39). 

Although the current consensus view 
is in favor of force delivery by the DNA 
packaging ATPase, this consensus view is 



not a fact. A reasonable alternative exists, 
with significant empirical support. Thus, 
the duality of this section is ongoing. 

Duality #5: Analysis via 
(A) Previously Isolated Intermediates, 
(B) New Intermediates 

Single-molecule analysis. Both nano- 
metry and other single-molecule-based 
procedures access some stages of DNA 
packaging while bypassing both elec- 
tron microscopy and isolation of most 
intermediates. The procapsid is the only 
intermediate that one must isolate for sin- 
gle-molecule-based procedures. Thus, two 
strategies now exist for directly access- 
ing the various states of DNA packaging 
motors. These two strategies form the 
basis for duality #5: single molecule-based 
procedures, which use previously isolated 
intermediates, and isolation/characteriza- 
tion-based procedures, which are based 
on isolating new intermediates. Single- 
molecule-based procedures are favored for 
simplicity of the biochemistry. 

Thus, the question arises of what, in 
theory, is needed beyond single-molecule- 
analysis. The answer from the devil's 
advocate is "nothing." In favor of this 
answer, already, ensemble-averaging stud- 
ies by FRET have revealed packaging 
associated (1) contraction (crunching) of 
the segment of a T4 DNA molecule that 
is lodged in the motor during in vitro 
DNA packaging 75 and (2) changes in the 
distance of DNA molecule to one region 
of the T4 terminase. 76 In a single molecule 
mode (via fluorescence microscopy), these 
and other motions of terminase/ATPase, 
DNA and connector can be visualized in 
the sequence in which they occur during a 
cycle of the motor. Potentially, one could 
design appropriate probes for any motion 
proposed to be that of a power stroke, with 
the design based on currently known struc- 
tures of T4 terminase (reviewed in ref. 80) 
and the consensus connector structure 
(reviewed in ref. 39). This strategy is lim- 
ited by the fact that current structural data 
do not include high-resolution structures 
of the entire motor complex. 

In addition, the single-molecule-based 
procedures (1) do not provide access 
to unanticipated aspects of dynamics, 
(2) reveal a relatively narrow spectrum of 
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information about the anticipated aspects 
of the dynamics and (3) do not reveal 
information about aspects of in vivo DNA 
packaging that are not aspects of in vitro 
DNA packaging. At least for now, isola- 
tion and characterization of intermediates 
will be needed to maintain a reasonable 
rate of progress. Nonetheless, we have a 
mild chicken/egg paradox with the isola- 
tion of intermediates. We do not know 
the characteristics of some intermediates 
because we have not yet isolated them. 
We have not yet isolated them, in part, 
because we have only limited knowledge 
about their characteristics. 

Analysis via fractionated intermediates 
produced in vivo: General principles and 
recent results. A way to begin resolution of 
this paradox is to base fractionations on the 
most general characteristics that intermedi- 
ates must have. This is done independently 
of what the more detailed (and generally 
unknown) characteristics of each inter- 
mediate are. For example, early studies of 
phi29 91 and T7 92 DNA packaging resulted 
in isolation of DNA packaging-generated 
capsids had incompletely packaged DNA 
(ipDNA), without an external DNA seg- 
ment. For both phi29 and T7, the external 
segment-free ipDNAs formed sharp bands 
(were quantized) when expelled from 
capsids and analyzed by agarose gel elec- 
trophoresis. But, the capsids with ipDNA 
(ipDNA-capsids) were not initially isolated 
in stable form and characterized. This situ- 
ation was resolved when phage T3 ipDNA- 
capsids generated in vivo were isolated and 
characterized, with the isolation based on 
the fundamentals of what their densities 
had to be. 

The following were densities expected 
after centrifugation in cesium chloride 
density gradients, with effects of pressure 
(relatively small) neglected. (1) Double- 
stranded DNA has a density of 1.7 g/ml. 93 

(2) Proteins vary in density with the per- 
centage of charged amino acids, but almost 
always have a density close to 1.3 g/ml. 94 

(3) T3 and T7 phages (DNA/protein ratio 
-1.0) have a density of -1.5 g/ml. Thus, 
T3 ipDNA-capsids will have densities 
between 1.3 g/ml and 1.5 g/ml. The most 
uncertain aspect is the stability of the 
ipDNA-capsids during fractionation. 

At least one general principle also 
appears to exist for the stabilizing of 



intermediates, including ipDNA-capsids. 
The packaged DNA state can be stabi- 
lized by "osmotic compression," whereby 
a capsid-impermeable compound is pres- 
ent outside of a capsid at a concentra- 
tion higher than inside. The result is an 
osmotic pressure gradient that blocks exit 
of packaged DNA. 95 ' 96 The first step in the 
purifying of T3 ipDNA-capsids has been 
precipitation with polyethylene glycol, 97 
which, in addition to causing precipita- 
tion, provides stabilization of ipDNA-cap- 
sids via osmotic compression. 

Some empiricism is needed with 
regard to other buffer components. For 
example, Mg 2t and other divalent cations 
destabilize the packaged DNA state of 
tail-free phage lambda capsids (heads). 98 
Nonetheless, Mg 2t stabilizes the packaged 
DNA state of the mature lambda phage 
(with tail). 99 Cesium chloride was found 
to dramatically increase the stability of T3 
ipDNA-capsids. 97 

In other areas also, one cannot elimi- 
nate trial-and-error in the isolation of 
DNA packaging intermediates. For exam- 
ple, the procapsid (capsid I) of the T3- and 
T7- related phage, phill, is so unstable 
that it converts to a capsid II-like par- 
ticle during ultracentrifugal purification. 
The phill procapsid was detected only 
by native agarose gel electrophoresis of a 
fresh, unfractionated lysate. 100 If all phage 
procapsids were this unstable, we would 
have obtained them (for single-molecule 
studies, for example) only if someone had 
made a substantial effort in the direction 
of stabilization. Similarly, the T3 ipDNA- 
capsids are more stable than their T7 
counterparts. 97 

The stability of in vivo-generated T3 
ipDNA-capsids enabled their fraction- 
ation by ipDNA length and subsequent 
analysis of structure. The structures 
were determined by cryo-EM, with 3D 
symmetric reconstruction of both the 
ipDNA and the capsid components of T3 
ipDNA-capsid II. These 3D reconstruc- 
tions provided a test for both analytical 
physics-based and computer modeling- 
based determination of the conformation 
of both ipDNAs and DNA packaged in 
the mature phage. 97 The following con- 
clusions were drawn from these data. (1) 
The ipDNA enters the capsid in a confor- 
mation indistinguishable from random. 



(2) The ipDNA forms a statistical ring at 
the inner surface of the outer shell, when 
an -2-ring amount of DNA is packaged. 

(3) Subsequently, additional packaged 
DNA rings form (see Fig. 3A) as more 
DNA is packaged. Conclusion (1) was also 
drawn via cryo-EM of in vitro-generated 
phi29 ipDNA-capsids; in this latter study, 
an ipDNA ring could not be detected 
because 3D reconstruction was not per- 
formed. 101 These studies, which resolved 
a previous duality of DNA conformation 
during packaging (reviewed in ref. 86), 
could not have been performed without 
isolating the intermediates, given current 
technology. 

As more intermediates are isolated and 
provide increasingly detailed snapshots 
of the various states of the DNA packag- 
ing motor, one expects to need increasing 
innovation to obtain additional detail 
via isolation of yet more intermediates. 
The reason is that "the lower hanging 
fruit" will have already been picked and 
the remaining intermediate states will be 
either (1) more transient, so that interme- 
diates are present in smaller amount, or 
(2) more unstable, so that the intermedi- 
ates are more easily lost, unless the motor 
is perturbed from its normal cycle. For 
stabilizing intermediates, genetics is used 
to perturb the cycle with the most flexibil- 
ity and specificity. The rapid/inexpensive 
character of phage propagation is a major 
asset for performing genetic perturbation. 

To bypass traditional fractionation 
procedures in the discovery of new inter- 
mediates, one possibility for the future is 
to (1) perform cryo-EM of fresh, unfrac- 
tionated lysates and (2) affinity-tag cap- 
sids and purify and concentrate capsids 
by using adherent support films to bind 
capsids from fresh lysates for cryo-EM. 
Feasibility in the case of phage T7 is 
shown in Figure 4. With this procedure, 
particles of different F are simultaneously 
observed. Importantly, intermediates are 
observed without the risk of fractionation- 
associated alteration. 

Analysis via Fractionated 
Intermediates: Some Past History 
and Future Goals 

The current understanding of DNA 
packaging is critically dependent on 
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Figure 4. Affinity cryo-EM of capsids and phage in an unfractionated lysate of phage T7-infected Escherichia coli.JUe host cells were infected by T7 
phage His-tagged at the C-terminus of gplO. Particles in a spontaneous lysate were partially purified and concentrated by adsorption to carbon- 
coated grids that had been coated with a Ni-NTA-containing lipid monolayer. Representative examples of capsid I, capsid II and phage particles are 
indicated (Unpublished work of F. Vago and W. Jiang). 



early investigations of procapsids, such as 
capsid I. The procapsids were observed 
biochemically by isolation and charac- 
terization. They were also identified by 
electron microscopy of thin sections of 
cells infected by, most dramatically, phage 
T4. 102 Biochemically speaking, the initial 
definitive experiment was performed for in 
vivo packaging of a T4 mutant with tem- 
perature-sensitive packaging. (1) Capsids 
were pulse-labeled with either 14 C (radioac- 
tive) or 15 N (in sufficient concentration to 
alter the density) at a temperature so high 
that packaging did not occur. (2) The tem- 
perature was then lowered and an excess of 
12 C and 14 N-substrates (i.e., chase labels) 
were added. (3) Incubation was continued 
to allow packaging to occur, sometimes 
with 3 H-labeling of DNA. (4) Tests were 
made for either 14 C, 3 H or density among 



phage particles formed after the chase. 
Both capsids and phage particles were 
isolated/identified by rate zonal centrifu- 
gation in a sucrose gradient. 103 

The experiments of reference 103 
revealed that, without disassembling/re- 
assembling, 14 C-labeled, pre-assembled 
capsids were filled with DNA after tem- 
perature downshift. Some DNA pack- 
aged in pre-temperature shift-assembled 
capsids was synthesized after lowering of 
temperature. 

In addition, procapsids were sub- 
sequently found to package DNA in 
cell-free in vitro systems that were ini- 
tially developed for T7 104 and lambda. 105 
In these studies, the packaging effi- 
ciency per procapsid did not dramati- 
cally decrease when the procapsids were 
diluted, which meant that the procapsids 



packaged DNA without dissociating and 
re-assembling. 

These results illustrate the rigor and, 
therefore, the beauty with which experi- 
ments can be performed when the inves- 
tigator has information about what the 
intermediates are. To learn the details of 
the program of DNA packaging motors, 
current objectives include answering 
the following questions via the isolation 
and characterization of DNA packaging 
intermediates. 

(1) What is the extent of packaging- 
associated shell dynamics after the well- 
established initial change 7 " 9 ' 39,106 of the 
shell (capsid I-to-capsid II transition of 
Fig. 1 for T3/T7)? We do have the fol- 
lowing evidence that post-capsid II shell 
dynamics occur in the case of T3/T7. 
Some in vivo-produced T3 ipDNA-capsids 



248 



Bacteriophage 



Volume 2 Issue 4 



have shells larger than the shell of the 
mature phage capsid, as seen by cryo-EM, 
assisted by native gel electrophoresis in 
two dimensions. 107 Some ipDNA-capsids 
have shells even smaller than the shell of 
capsid I, when stabilized by glutaralde- 
hyde cross-linking and analyzed by native 
gel electrophoresis in two dimensions. 108 

(2) What is the extent to which either 
DNA packaging or procapsid assembly 
causes accidental uptake of molecules 
other than DNA? A packaged DNA 
double helix occupies about 50% of the 
volume of the cavity in which it is pack- 
aged for at least phages lambda, P22 and 
T7 _io9,no Accidentally packaged non-DNA 
molecules possibly include RNAs and pep- 
tides present in infected cells (or impure 
in vitro packaging systems). Holes in the 
shell must develop during DNA packag- 
ing because packaging-associated expul- 
sion of either internal phage-encoded 
proteins or their proteolytic fragments 
usually occurs. 7 " 10 Thus, conditions for 
cytoplasmic molecule uptake exist. The 
total concentration of RNA and protein 
in an infected cell is 30-40%.'" If the 
fluid in the DNA-containing cavity of a 
mature phage particle had this concentra- 
tion, 65-70% of the total space would be 
occupied either by DNA or other, imbibed 
molecules. 

Electron microscopy of thin sections 
does provide some suggestive results for 
DNA packaging in phage T4-infected 
cells. These results were observations of 
expanded procapsids (analogous to what 
we call capsid II for T3/T7) that were par- 
tially full of DNA. The latter were called 
grizzled particles because of the irregular 
appearance of the ipDNA. The grizzled 
particles were, however, not visible unless 
steps were taken to leak proteins out of the 
infected cell, apparently because packaged 
host "cytoplasm" matched the surround- 
ing external cytoplasm. 102 This observa- 
tion suggests that packaging of non-DNA 
molecules does occur in vivo. 

(3) What is the linkage between pack- 
aging ATPase/terminase-derived ATP 
cleavage and power stroke? Only sug- 
gestive data currently exist. For example, 
phi29 nanometry has revealed the fol- 
lowing. (A) The time of a dwell phase of 
no DNA motion is decreased by increase 
of ATP concentration, but the time of a 



10 base pair "burst" of motion is inde- 
pendent of ATP concentration. 71 This 
observation suggests that ATP binding 
occurs only during a dwell 71 and pre- 
sumably causes the dwell, as recently 
also found biochemically for the phi29 
packaging ATPase. 112 The proposal has 
been made that ATP-induced DNA bind- 
ing is a thermal ratchet-like precursor to 
a power stroke, 47 and that this binding 
evolved from a more primitive thermal 
ratchet. 15 (B) The width of the dwell time 
distribution at low and saturating ATP 
concentration led to the conclusion that 
at least two ATP-binding and four non- 
binding, ATP-dependent transitions are 
involved. 71 (C) Increasing the packaging- 
resisting force increases dwell time to the 
extent that 2.5 base pair "mini-bursts" 
were observed within each 10 base pair 
burst. The packaging length of a mini- 
burst is incompatible with maintaining 
a constant relationship with any feature 
of a DNA double helix, thereby suggest- 
ing that the power stroke is delivered 
through steric interaction with the DNA 
molecule. 71 This conclusion is confirmed 
by the packaging of a variety of different 
polymers inserted in a DNA duplex. 72,113 
(4) Translocation is coupled to post-bind- 
ing ATP hydrolysis, based on the response 
to changing of the concentrations of ADP 
and phosphate. 9,71,114 

However, the above data do not show 
that, at all (or even any) values of F, all 
(or even most) ATP molecules bound 
during a power stroke are subsequently 
hydrolyzed during the same power stroke. 
If this were the case, then, as mentioned 
in Duality #4, inefficiency is introduced. 
Alternatively, either an ATPase-as-spark- 
plug model (refs. 47 and 78, for example) 
or a more complex-than-previously-pro- 
posed all ATPase model can accommo- 
date the possibility that only some of the 
bound ATP molecules are cleaved during 
each power stroke. In this case, the power 
stroke-associated cleavage probability per 
bound ATP molecule would increase as F 
(and packaging-resisting force) increased 
and more energy was needed. For an 
ATPase-as-spark-plug model, control of 
this probability would occur via inter- 
action with the connector, which stores 
energy between power strokes. We note 
that, if some of the nanometry-detected, 



ATP-dependent, non-binding events 71 
are ATP cleavages, feedback regulated 
ATP cleavage of this type will produce 
sharpening of the probability vs. dwell 
time distribution (Fig. 2A in ref. 71) as 
F increases. 

As well summarized in reference 9, 
an additional constraint on hypotheses is 
the finding that the total time to package 
a genome is approximately invariant for 
genome lengths between 19 Kb (phi29) 
and 170 Kb (T4). One might extend this 
invariance to all phages that form plaques 
either as quickly or more quickly than 
phage T4, including phage G, with a 600 
Kb genome. One wonders whether the 
phage G packaging ATPase really cleaves 
ATP at 30x the rate of the phage phi29 
packaging ATPase, as required by current 
packaging ATPase-as-motor models, but 
not required of an ATPase-as-spark plug 
model with a connector with flywheel-like 
function. 

To obtain more information, one strat- 
egy is to isolate and characterize ipDNA- 
capsids that have ipDNA engaged with the 
motor at various stages of a power stroke. 
If one fractionates these ipDNA-capsids 
by stage of power stroke, one could obtain 
a series of "snapshots" of the various stages 
of a power stroke. In theory, the structural 
details of each snapshot can be obtained 
by cryo-EM with 3D, asymmetric recon- 
struction, in analogy with what was done'' 7 
for determining ipDNA conformation 
vs. F with symmetric 3D reconstruction. 
Achieving success in fractionation of this 
type appears daunting at first, especially 
since each needed intermediate is likely to 
represent only a minor fraction of the total 
capsids and ipDNA. 

However, success becomes a plausible 
outcome if one can find a "probe" com- 
pound that (1) enters ipDNA-capsids at 
a rate that depends on the state of the 
associated motor and (2) either forms a 
density gradient for buoyant density cen- 
trifugation or modifies ipDNA-capsid 
density in a density gradient formed by 
another compound (see Fig. 5 and leg- 
end). An illustration of the basic prin- 
ciple is in Figure 11 in reference 115, 
with an iothalamate probe. Characteristic 
(1) requires that the largest hole in the 
shell is the channel of connector. The 
largest hole in T7 capsid II is, indeed, the 
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Figure 5. Permeation-based fractionation of ipDNA-capsids. (A) A compound to be used for 
buoyant density centrifugation (small circles; Metrizamide, for example) enters a DNA-free capsid 
via the channel of the connector. The shell is impermeable to this compound, as found 100115 for 
Metrizamide and the first post-capsid I version of T3/T7 capsid II. (B) The presence of a DNA 
molecule in the channel inhibits the entry of this compound. The inhibition can be complete, 
as pictured here, or it can be partial. If partial, the rate of entry will be dependent on where in 
the cycle of a power stroke the motor is. Thus, if the permeation does not go to completion (see 
Fig. 11 in ref. 115), the extent of permeation and, therefore, the capsid density will be controlled 
by the smallest channel in the connector, which, in turn, will be controlled by the stage of a power 
stroke at which the motor stopped. 



channel of the connector, based on the 
binding of dyes to a relatively imperme- 
able T7 capsid II." 6 The result of such a 
permeability-based fractionation would 
be density that depends primarily on the 
state of the motor and only secondarily on 
ipDNA length. Thus, this fractionation 
would yield a collection of fractions each 
of which represents a different state of the 
DNA packaging motor. 

Additional Points 
About Intermediates 

In any such study, each potential inter- 
mediate is tested to determine whether it 
(1) was conducting productive packag- 
ing when the process of packaging was 
stopped and (2) has undergone significant 
subsequent changes during fractionation. 
The primary test for participation in 
productive DNA packaging is a kinetics- 
based experiment similar to the experi- 
ment described above for determining 
that, what is now called the T4 procapsid, 
was an intermediate of productive pack- 
aging. This point was once in question, 
in contrast to what one might think. For 
example, the lambda procapsid was once 
called petit lambda. 

When analyzing post-procapsid stages 
of DNA packaging, kinetic labeling 



identifies an intermediate via its time pro- 
file. The labeling of an intermediate first 
increases, as the intermediate is generated 
from its precursor and, then, decreases, as 
the intermediate generates its successor. In 
contrast, the labeling of an abortive end 
product increases progressively with time. 
The labeling of the productive end prod- 
uct (phage) also progressively increases 
with time. Examples are in references 92, 
100, 115. 

A second test is to determine whether 
a potential intermediate can re-establish 
DNA packaging in vitro. If it can, then the 
potential intermediate is likely (although 
not certain) to be an intermediate in vivo. 
This test was also described above for pro- 
capsids. Of course, failure of this test does 
not imply abortive end product status, 
because the intermediate may have been 
inactivated during isolation, possibly in a 
trivial way. 

A third test is to perform in vitro real 
time, single-molecule observation of DNA 
packaging while probing for the exis- 
tence of particle that has (1) a signature 
previously-identified to be the signature 
of a potential intermediate and (2) the 
appearance kinetics of an intermediate. 
The process observed must end by produc- 
ing a phage-like particle. Perhaps, the best 
signatures are fluorescent signatures that 



identify potential intermediates previously 
observed via isolation/characterization. 
The study of connector rotation in refer- 
ence 65 (Duality #2) is a study of basically 
this type, as is a previous single-molecule- 
based study that that found phage T7 
DNA packaging to be cooperative among 
the various capsids that package a single 
concatemer in vitro. 117 This third test 
integrates isolation/characterization with 
single-molecule studies and can eventu- 
ally, via fluorescence signatures, be used to 
analyze intermediates in vivo. Given the 
potential of the single-molecule compo- 
nent of the strategy, one looks for ways to 
more rapidly complete the isolation/char- 
acterization component. The isolation/ 
characterization component is limited by 
lack of synchrony among different DNA 
packaging events. 

One way to bypass this limitation is 
to fractionate intermediates by F, which 
opens the possibility of substituting F 
for time in the analysis of intermediate 
progression.' 7 In this case, a single frac- 
tionation can, in theory, provide a time 
progression, although without defining 
the magnitude of the time scale. This 
strategy has been used with ipDNA- 
capsids. 97 However, all ipDNA-capsids 
are inactive. The act of taking each 
"snapshot" blocks further advancement 
of packaging. Nonetheless, taking snap- 
shots is critically important, because it 
makes possible subsequent determination 
of the characteristics of the target of the 
snapshot. This uncertainty principle-like 
paradox will become downsized as snap- 
shot taking advances and becomes less of 
a perturbation. 

However, even with an altered ipDNA- 
containing particle, one still has a reason- 
able test for intermediate status, if one 
makes the following assumption. Any 
transition to abortive end product status 
becomes more likely as the forces involved 
increase, i.e., as F increases, before com- 
pletion of packaging. For example, as 
F approached 0.9, a sudden decrease in 
resisting force was observed during nano- 
metry of lambda DNA packaging. 9,73 This 
decrease did not typically occur at either 
smaller F or F = 1.0. This decrease in 
resisting force was attributed to formation 
of a broken-shell abortive end product, 9,73 
although hyper-expansion of the shell is 
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also a possibility (see Duality #6). If a 
potential intermediate does not progres- 
sively increase in amount as F increases 
for F < 1.0, then the test for intermedi- 
ate status is positive, as in the case of T3 
ipDNA-capsids. 97 A subsequent test would 
be performed by use of real time single- 
molecule-based procedures with a signal 
for each potential intermediate derived 
from particles previously analyzed via 
fractionation and characterization. 

Duality #6: A Motor with 
(A) One cycle, (B) Two cycles 

The above sections focused on how the 
DNA packaging ATPase drives DNA 
packaging. The conventional assumption 
is that a packaging ATPase-derived cycle 
(with or without the connector in the 
power stroke) is the only cycle that pack- 
ages DNA. Support for this assumption 
is the observation by nanometry that an 
entire DNA molecule can be packaged 
without obvious change in the mechanism 
of force generation (reviewed in ref. 9). 
Empirically, no other reason apparently 
exists to make this assumption. The in 
vitro system used for these studies of 
phi29 was made of purified components. 
Duality #6 is derived from data, described 
below, that suggest the presence of a sec- 
ond, back-up cycle. 

A possible second cycle: Data. As 
in other fields, some data suggested a 
paradigm shift before serious attempts 
were made to interpret these data. 
"Quantized" packaging (revealed via 
band-forming ipDNAs) of phage phi29 
DNA was observed when the packaging 
was performed in vitro in unfraction- 
ated extracts of phi29-infected cells. 91 If 
one assumes that each ipDNA band is 
generated by a cycle of the motor, a cycle 
packages more than 100 base pairs, based 
on the separation of gel electrophoretic 
bands of ipDNA. However, the cycle 
of the packaging ATPase, as defined by 
nanometry, 71,72 occurs on a scale of 10 
base pairs. 

In addition, the sharp ipDNA bands 
observed for phage T7 after in vivo pack- 
aging could not be explained by sequence- 
specific cleavage, based on analysis via 
the T7 genomic DNA sequence. 92 T3/T7 
band separations were also not compatible 



with a 10 base pair cycle. The T3 separa- 
tions, in terms of DNA length, varied but 
were typically not more than 20% from 
1 Kb. Kinetics of appearance indicated 
that some quantized ipDNA-producing 
cleavages occurred in T7 DNA molecules 
that would have been further packaged if 
packaging had not been interrupted by 
investigator's sampling of infected cells. 
Other ipDNAs accumulated with time, 
as though generated by cleavage that had 
occurred before sampling. 92 

The sharpness and spacing of the 
bands suggest production of band-form- 
ing ipDNA via cleavage at a point of rela- 
tively slow packaging in a second cycle 
of the motor, i.e., a cycle in addition to 
the ATPase-dependent cycle detected by 
nanometry. A second cycle was not dis- 
cussed in references 91, 92. 

Further analysis was not performed 
until characterization of in vivo-produced 
ipDNA-capsids progressed. Specifically, 
F-fractionated T3 ipDNA-capsids were 
found to have ipDNAs some (not all) of 
which were quantized after expulsion 
from capsids; the remaining ipDNAs 
were broadly distributed. As F increased, 
an increase occurred in (1) the percentage 
of ipDNA that was quantized and (2) the 
percentage of (intact) ipDNA-capsids 
that did not form a band of capsid II, 
but were broadly distributed, when ana- 
lyzed by one-dimensional, native agarose 
gel electrophoresis (ld-AGE). Migration 
of ipDNA-capsids during native aga- 
rose gel electrophoresis is independent of 
ipDNA length, as long as the DNA is all 
packaged. In the case of one T3 deletion 
mutant (and for reasons not known), all 
ipDNA-capsids were broadly distributed 
during ld-AGE and all ipDNAs were 
quantized. That is to say, the quantized 
ipDNAs were released by the broadly dis- 
tributed ipDNA-capsids. 107 

The broadly distributed ipDNA- 
capsids were subsequently characterized 
via native, two-dimensional agarose gel 
electrophoresis (2d-AGE; recent reviews 
are in refs. 118-120). Both ld-AGE and 
2d-AGE fractionate particles only by two 
particle characteristics, (1) the per area 
surface electrical surface charge not neu- 
tralized by counter-ions (o~) and (2) the 
effective radius (R E )- Use of 2d-AGE sepa- 
rated effects of cr from effects of R„ and 

E 



revealed that the broadly distributed 
ipDNA-capsids varied in both o~ (nega- 
tive) and R E . Both R E and the magnitude 
of o~ were greater for these broadly distrib- 
uted ipDNA-capsids than for capsid II. 
However, some of the R's were smaller 

' E 

than the R E of even the procapsid when 
the ipDNA-capsids were cross-linked with 
glutaraldehyde. 108 

One interpretation of the above data is 
that the possible second cycle, discussed 
above, does exist and that this second 
cycle is a shell hyper-expansion/contrac- 
tion-producing cycle that does not always 
operate because it functions as a back-up 
cycle. According to this interpretation, 
one cause of un-reversed stalling is acci- 
dental presence of non-DNA molecules; 
the second cycle, in theory, 19 expels the 
non-DNA molecules. The back-up cycle 
(1) would be initiated by un-reversed stall- 
ing, such as seen for phage lambda (see 
Duality #4), (2) has the potential to cause 
selection for the encoding of an additional 
shell protein to maintain shell stability 
during hyper-expansion, and (3) might be 
triggered, in the case of phi29, by domain 
II of the packaging ATPase, a domain 
that is apparently needed only in vivo (see 
Duality #1). 

Some evidence supports this concept. 
Specifically, (1) the phage lambda shell 
decoration protein (D protein) is needed 
to completely package DNA, suggesting 
that a change in the shell is necessary to 
achieve an intermediate in a late stage 
of packaging and (2) quantized lambda 
ipDNA is produced when filled heads are 
destabilized by withdrawing the stabiliz- 
ing agents, ATP and ADP, as though the 
shell is the ruler for quantized ipDNA 
length. 121 In summary, evidence exists 
for a second cycle, thereby generating the 
duality of this section. Further discussion 
is in Supplemental Material, Section 5. 

Concluding 
Comments 

Studies of phage DNA packaging are of 
fundamental importance for understand- 
ing not only current aspects of biological 
motors, but also the abiotic and biotic 
ancestry of biological motors, viruses and 
cells. The most likely links to ancient 
ancestry are imprints that are ancient 
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in origin and that were retained in cur- 
rent phages because at least some of their 
selective advantages remained throughout 
history."' 122 ' 123 

Independently of ancestry, studies of 
phage DNA packaging are also of funda- 
mental importance for integrating physi- 
cal chemistry (for example, the various 
aspects of nanometry and cryo-EM), 
biochemistry (for example, the various 
aspects of both intermediate isolation 
and in vitro DNA packaging) and genetic 
perturbation of the process being studied. 
Phage genetics has long been the most 
advanced field of biochemical genetics, 
because of the relatively high speed of the 
replication of both phages and their hosts 
(reviewed in refs. 124 and 125). 

The following aspects of phage genet- 
ics have potential for serving as models 
for future advances of all genetics. (1) 
Phage mutants can be rapidly selected 
based on selection criteria that are likely 
to produce multiple mutations. An exam- 
ple is resistance to elevated sodium chlo- 
ride concentration in growth media. 108 
Multiple mutation-based genotype-phe- 
notype correlations are becoming pos- 
sible via next generation, high-throughput 
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